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Abstract 

Traditional measures of diversity, namely the number of species as well as Simpson's and Shannon's indices, are particular 
cases of Tsallis entropy. Entropy decomposition, i.e. decomposing gamma entropy into alpha and beta components, has 
been previously derived in the literature. We propose a generalization of the additive decomposition of Shannon entropy 
applied to Tsallis entropy. We obtain a self-contained definition of beta entropy as the information gain brought by the 
knowledge of each community composition. We propose a correction of the estimation bias allowing to estimate alpha, 
beta and gamma entropy from the data and eventually convert them into true diversity. We advocate additive 
decomposition in complement of multiplicative partitioning to allow robust estimation of biodiversity. 
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Introduction 

Diversity partitioning means that, in a given area, the gamma 
diversity D y of all individuals found may be split into within (alpha 
diversity, D x ) and between (beta diversity, Dp) local assemblages. 
Alpha diversity reflects the diversity of individuals in local 
assemblages whereas beta diversity reflects the diversity of the 
local assemblages. The latter, Dp, is commonly derived from D a 
and Dy estimates [1]. Recently, a prolific literature has emerged on 
the problem of diversity partitioning, because it addresses the issue 
of quantifying biodiversity at large scale. Jost's push [2-5] has 
helped to clarify the concepts behind diversity partitioning but 
mutually exclusive viewpoints have been supported, in particular 
in a forum organized by Ellison [6] in Ecology. A recent synthesis by 
Chao et al. [7] wraps up the debate and attempts to reach a 
consensus. Traditional measures of diversity, namely the number 
of species as well as Simpson's and Shannon's indices, are all 
special cases of the Tsallis entropy [8,9]. The additive decompo- 
sition [10] of these diversity measures does not provide indepen- 
dent components but Jost [3] derived a non-additive partitioning 
of entropy which does. 

A rigorous vocabulary is necessary to avoid confusion. Unrelated 
or independent (sensu [7]) means that the range of values oi q Dp is 
not constrained by the value of q D a , which is a desirable property. 
Unrelated is more pertinent than independent since diversity is not a 
random variable here, but independent is widely used, by [3] for 
example. We will write independent throughout the paper for 
convenience. We will write partitioning only when independent 
components are obtained and decomposition in other cases. 

Tsallis entropy can be easily transformed into Hill numbers 
[11]. Jost [3] called Hill numbers true diversity because they are 
homogeneous to a number of species and have a variety of 
desirable properties that will be recalled below. We will call diversity 



true diversity only, and entropy Simpson and Shannon indices as 
well as Tsallis entropy. The multiplicative partitioning of true y 
diversity allows obtaining independent values of a and /? diversity 
when local assemblages are equally weighted. 

However, we believe that the additive decomposition of entropy 
still has something to tell us. In this paper, we bring out an 
appropriate mathematical framework that allows us to write Tsallis 
entropy decomposition. We show its mathematical equivalence to 
the multiplicative partition of diversity. This is simply a 
generalization of the special case of Shannon diversity [12]. Doing 
so, we establish a self-contained (i.e. it does not rely on the 
definitions of a and y entropies) definition of /? entropy, showing it 
is a generalized Jensen-Shannon divergence, i.e the average 
generalized Kullback-Leibler divergence [13] between local 
assemblages and their average distribution. Beyond clarifying 
and making explicit some concepts, we acknowledge that this 
decomposition framework largely benefits from a consistent 
literature in statistical physics. In particular, we rely on it to 
propose bias corrections that can be applied to Tsallis entropy in 
general. After bias correction, conversion of entropy into true 
diversity provides independent, easy-to-interpret components of 
diversity. Our findings complete the well-established non-additive 
(also called pseudo-additive) partitioning of Tsallis entropy. We 
detail their differences all along the paper. 

Methods 

Consider a meta-community partitioned into several local 
communities (let i = l, 2, denote them), individuals are 
sampled in community i. Let S = 1,2, ... ,S denote the species that 
compose the meta-community, n S j the number of individuals of 
species s sampled in the local community i, n s = n s i the total 
number of individuals of species s,n= Z~2i n si me total number 
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of sampled individuals. Within each community i, the probability 
Psi for an individual to belong to species S is estimated by 
Psi = n si/ n i- The same probability for the meta-community is p s . 
Communities may have a weight,vt',-, satisfying p s = Wjp s {. The 
commonly-used w,- = n,/w is a possible weight, but the weighting 
may be arbitrary (e.g. the sampled areas). 

We now define precisely entropy. Given a probability distribu- 
tion p s = {p\',P2', ■ ■ ■ \Ps\ ■ ■ ■ 'iPs}, we choose an information 
function T(p s ), which is a decreasing function of p s having the 
property X(1)=0: information is much lower when a frequent 
species is found. Entropy is defined as the average amount of 
information obtained when an individual is sampled [14]: 

H=J2p s l(p s ) (1) 

S 



a and y diversity values are given by Hill numbers q D, called 
"numbers equivalent" or "effective number of species", i.e. the 
number of equally-frequent species that would give the same level 
of diversity as the data [14]: 

Routledge a diversity is: 



The best-known information function is T(p s ) = — In (p s ). This 
defines the entropy of Shannon [15] . 1(p s ) = ( 1 — p s ) / Ps yields the 
number of species minus 1 and I{p s ) = \—p s , Simpson's [16] 
index. Relative entropy is defined when the information function 
quantifies how different an observed distribution p s is different 
from the expected distribution p' s . The Kullback-Leibler [17] 
divergence is the best-known relative entropy, equal to 
Yl s Ps m (Ps/p's)- Shannon's beta entropy has been shown to be 
the weighted sum of the Kullback-Leibler divergence of local 
communities, where the expected probability distribution of 
species in each local community is that of the meta-community 
[12,18]: 



Xr i> = E w '' E fe ln 



Psi 



(2) 



Combining (3) and (6) yields: 



"D y = {\-{q-\)"H y y-i 



(8) 



We also use the formalism of deformed logarithms, proposed by 
Tsallis [21] to simplify manipulations of entropy. The deformed 
logarithm of order q is defined as: 



ln fl x = 



x l -i-l 



(9) 



It converges to In when q->l. 

The inverse function of ln ? x is the deformed exponential: 



Let us define y as the meta-community's diversity, a as local 
communities' diversities, and /? as diversity between local 
communities. Tsallis y entropy of order q is defined as: 



q H y = (3) 
and the corresponding a entropy in the local community / is: 



q tt _ 1 Xls Psi 
i " 



q-l 



(4) 



cJ = [l+(l- g )xP« 



(10) 



The basic properties of deformed logarithms are: 

ln q (xy)= ln ? x+ In, j>-(#-l)(ln 9 x)(ln 9 y) (11) 



ln fl 



1 



- x q 1 ln 9 x 



(12) 



The natural definition of the total a entropy is the weighted 
average of local community's entropies, following Roudedge [19]: 



L q — e q e q 



(13) 



(5) 



This is the key difference between our decomposition frame- 
work and the non-additive one. Jost [3] proposed another 
definition, q H a = ^2, -( w 1/ Si w l me normalized q- 

expectation of the entropy of communities [20] rather than their 
weighted mean. It is actually a derived result, see the discussion 
below. Our results rely on Routledge's definition (see Appendix 
SI). 



Tsallis entropy can be rewritten as: 
1-M 



q-l 



(14) 



Diversity and Tsallis entropy are transformations of each other: 

(15) 



q H y =\n q q D 7 
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q D, 



(16) 



. Ps 



(23) 



Decomposing diversity of order q 

We start from the multiplicative partitioning of true diversity. 



q D y = q D\Dfj 



(17) 



If community weights are equal, f> diversity is independent of a 
diversity (it is whatever the weights if a diversity is weighted 
according to Jost, but this is not our choice). We will consider the 
unequal weight case later. 

yS diversity is the equivalent number of communities, i.e. the 
number of equally-weighted, non-overlapping communities that 
would have the same diversity as the observed ones. 

We want to explore the properties of entropy decomposition. 
We calculate the deformed logarithm of equation (17): 

In, q D y = In, q D, + In, "Dp -(?-!)( In, »J>„)( In, "Dp) (18) 



9 Hp converges to the Kullback-Leibler divergence when q-*l. 

The average Kullback-Leibler divergence between several 
distributions and their mean is called Jensen-Shannon divergence 
[23], so our /J entropy q Hp can be called generalized Jensen-Shannon 
divergence. It is different from the non-logarithmic Jensen-Shannon 
divergence [24] which measures the difference between the 
equivalent of our a entropy and — Wjp q si ln ? />* (the latter is 
not Tsallis y entropy). 

Our results are summarized in Table 1, including transforma- 
tion of entropy into diversity. The partition of entropy of order q is 
formally similar to that of Shannon entropy. It is in line with Patil 
and Taillie's [14] conclusions: q Hp is the information gain 
attributable to the knowledge that individuals belong to a 
particular community, beyond belonging to the meta-community. 

Information content of generalized entropy 

Both q H Y and q Hp must be rearranged to reveal their 
information function and explicitly write them as entropies. 
Straightforward algebra yields: 



> q H y = *H. + \n q "Dp - (q - 1) (»#„) (In, "Dp) (19) 



q H v 



pi 



r'-i 



q-l 



(24) 



Equation (19) is Jost's partitioning framework (equation 8f in 
[3]). Jost retains Hg= \n q q Dp as the /? component of entropy 
partitioning. It is independent of q H a (they are respective 
transformations of independent q Dp and q D a ), contrarily to the 
/? component of the additive decomposition [10,22] defined as 
q H y - q H x 

After some algebra requiring Roudedge's defmtiion of a diverity 
detailed in Appendix SI, we obtain from equation (19): 



q H.- q H a 



q-\ 



(20) 



The right term of equation (20) is a possible definition of the fS 
component of additive decomposition. It can be much improved if 
we consider YlsPs = YlsP} 1 XL ' w iPsi an d rearrange equation 
(20) to obtain: 



q H v -"H a 



(21) 



We obtained the /? entropy of order q. It is the weighted average 
of the generalized Kullback-Leibler divergence of order q 
(previously derived by Borland et al. [13] in thermostatistics) 
between each community and the meta-community: 



(22) 



?^=E^< 



(25) 



The information functions respectively tend to those of 
Shannon entropy when q— >1. 

Properties of generalized b entropy 

q H[j is not independent of q H x . Only Jost's Hb is an 
independent /$ component of diversity indices. But q Hp takes 
place in a generalized decomposition of entropy. Its limit when 
q->l is Shannon fi entropy, and in this special case only q H/j is 
independent of q H a . 

q H[j is interpretable and self-contained (i.e. it is not just a 
function of y and a entropies): it is the information gain brought by 
the knowledge of each local community's species probabilities 
related to the meta-community's probabilities. It is an entropy, 
defined just as Shannon /? entropy but with a generalized 
information function. 

q Hp is always positive (proof in [25]), so entropy decomposition 
is not limited to equally-weighted communities. 

Bias correction 

Estimation bias (we follow the terminology of Dauby and Hardy 
[26]) is a well-known issue. Real data are almost always samples of 
larger communities, so some species may have been missed. The 
induced bias on Simpson entropy is smaller than on Shannon 
entropy because the former assigns lower weights to rare species, 
i.e. the sampling bias is even more important when q decreases. 

We denote q H the naive estimators of entropy, obtained by 
applying the above formulas to estimators of probabilities (such as 
lHp=^2 s p q si \n q (p S i/p s )). Let q H denote the estimation-bias 
corrected estimators. Chao and Shen's [27] correction can be 
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Table 1. Values of entropy and diversity for generalized entropy of order q and Shannon entropy. 





Diversity measure 


Generalized entropy 


Shannon 


y entropy 


"#,< = -!>.? In,/)., 


// >;/'.!«/', 


entropy 






True y diversity (Hill number) 


"D r = e"> 


1 D v = e H < 


True j] diversity (numbers equivalent) 


<», 


<D f = e' H > 



The deformed logarithm formalism allows presenting all orders of entropy as a generalization of Shannon entropy. Generalized /i entropy is a generalized Kullback- 
Leibler divergence, i.e. the information gain obtained by the knowledge of each community's composition beyond that of the meta-community. Robust estimation of 
the entropy of real communities requires estimation bias correction introduced in the text. 
doi:1 0.1 371 /joumal.pone.0090289.t001 



applied to all of our estimators. It relies on the Horvitz-Thomson 
[28] estimator which corrects a sum of measurements for missing 
species by dividing each measurement by 1 —(1 —/>.„)", i.e. the 
probability for each species to be present in the sample. Next, the 
sample coverage of community i, denoted C,, is the sum of 
probabilities the species of the sample represent in the whole 
community. It is easily estimated [29] from the number of 
singletons (species observed once) of the sample, denoted S\, and 
the sample size : 



C, = l- 



s± 

n> 



(26) 



The sample coverage of the meta-community is estimated the 
same way: C=l— S l /n. An unbiased estimator of />,„ is 
Psi = Cip S i, and p s = Cp s . Combining sample coverage, Horvitz- 
Thomson and equation (23) estimator yields: 



(Cp s f\n q Cp s 
\-{\-Cp s f 



(C h ^ In C ' Psi 

v i-(i-QA„r 



(27) 



(28) 



Another estimation bias has been widely studied by physicists. 
The latter generally consider that all species of a given community 
are known and their probabilities quantified. Their main issue is 
not at all missing species but the non-linearity of entropy measures 
(see [30] for a short review). Probabilities p s are estimated by p s . 
For q>0, estimating pi by is an important source of 

underestimation of entropy. Grassberger [31] derived an unbiased 
estimator p'j. under the assumption that the number of observed 
individuals of a species along successive samplings follows a 
Poisson distribution, as in Fisher's model [32] although arguments 
are different. Grassberger shows that: 



- « V 



r(n, + i) 
r(« s — q+ 1) 



-\)"T(\+q) sin nq 
n(n + l) 



(29) 



where T( ) is the gamma function (T(n) = (n— 1)! if n is an 
integer). Practical computation of T(n s + 1) is not possible for large 
samples so the first term of the sum must be rewritten as: 



T{n„ + \)/T(n s — q+ X) = T(q)/B(n s — q+ \,q) where B is the beta 
function. This estimator can be plugged into the formula of Tsallis 
y entropy to obtain: 



q-\ 



(30) 



Other estimations of p q s are readily detailed here. Holste et al. 
[33] derived the Bayes estimator of p q s (with a uniform prior 
distribution of probabilities not adapted to most biological systems) 
and, recently, Hou et al. [34] derived H r = n/(n— 1)(1 — J2sP 2 s)> 
namely the bias correction proposed by Good [29] and Lande 
[10]. Bonachela et al. [30] proposed a balanced estimator for not 
too small probabilities p s which do not follow a Poisson 
distribution. This may be applied to low-diversity communities. 
In summary, the estimation of p q s requires assumptions about the 
distribution of/>,, and Grassberger's correction is recognized by all 
these authors as the best up-to-date for very diverse communities. 
Better corrections exist but are available for special values of q 
only, such as the recent Chao et a/.'s estimator of Shannon entropy 
[35]. 

The correction for missing species by Chao and Shen and that 
for non-linearity by Grassberger ignore each other. Chao and 
Shen's bias correction is important when q is small and becomes 
negligible for q = 2 while Grassberger's correction increases with q, 
vanishing for q = 0. A rough but pragmatic estimation-bias 
correction is the maximum value of the two corrections. It cannot 
be applied when q<0 (Grassberger's correction is limited to 
positive values of q) neither to /? entropy (Chao and Shen's 
correction can but Grassberger's can't). An estimator of /? entropy 
will be obtained as the difference between unbiased y and a. 
entropy. 

We illustrate this method with a tropical forest dataset already 
investigated by [12]. Two 1-ha plots were fully inventoried in the 
Paracou field station in French Guiana. This results in 1124 
individual trees (diameter at breast height over 10 cm) belonging 
to 229 species. Figure 1 shows diversity values calculated for q 
between 0 and 2, with and without correction. Chao and Shen's 
bias correction is inefficient for q > 1 . 5 and can even be worse than 
the naive estimator. In contrast, Grassberger's correction is very 
good for high values of q, but ignores the missed species and 
decreases when q— »0. The maximum value offers an efficient 
correction. By nature, a and y diversity values decrease with q 
(proof in [36]): around 300 species are estimated in the meta- 
community (q = 0, Figure 1), but the equivalent number of species 
is only 73 for q = 2. 
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0.0 0.5 1.0 1.5 2.0 

Order of Diversity 

Figure 1. Profile of the y diversity in a tropical forest meta- 
community. Data from French Guiana, Paracou research station, 2 ha 
inventoried, 1 124 individual trees, and 229 observed species. Solid line: 
without estimation bias correction; dotted line: Grassberger correction; 
dashed line: Chao and Shen correction. The maximum value is our bias- 
corrected estimator of diversity. 
doi:1 0.1 371 /journal.pone.0090289.g001 

Converting unbiased entropy into diversity introduces a new 
bias issue because of the non-linear transformation by the 
deformed exponential of order q. We follow Grassberger's 
argument: this bias can be neglected because the transformed 
quantity (i.e. the entropy) is an average value (the information) 
over many independent terms, so it has little fluctuations 
(contrarily to the species probabilities whose non-linear transfor- 
mation causes serious biases, as we have seen above). 

We used Barro Colorado Island (BCI) tropical forest data [37] 
available in the vegan package [38] for R [39] to show the 
convergence of the estimators to the real value of diversity. 21457 
trees were inventoried in a 50 hectare plot. They belong to 225 
species. Only 9 species are observed a single time, so the sample 
coverage is over 99.99%. The inventory can be considered as 
almost exhaustive and used to test bias correction. We subsampled 
the BCI community by drawing chosen size samples (from 100 to 
5000 trees) in a multinomial distribution respecting the global 
species frequencies. We drew 100 samples of each size, calculated 
their entropy, averaged it and transformed the result into diversity 
before plotting it in Figure 2. For low values of q, Chao and Shen's 
correction is the most efficient. It is close to the Chaol estimator 
[40] of the number of species for q = 0 (not shown). A correct 
estimation of diversity of order 0.5 is obtained with less than 1000 
sampled trees (around 2 hectares of inventory). When q increases, 
Grassberger bias correction is more efficient: for q= 1.5 and over, 
very small samples allow a very good evaluation. Both corrections 
are equivalent around q = 1.2 (not shown). 

Examples 

Simple, theoretical example 

We first propose a very simple example to visualize the 
decomposition of entropy. A meta-community containing 4 
species is made of 3 communities CI, C2 and C3 with weights 
0.5, 0.25 and 0.25. The number of individuals of each species in 
communities are respectively (25, 25, 40, 10), (70, 20, 10, 0), (70, 
10, 0, 20). The resulting meta-community species frequencies is 
(0.475, 0.2, 0.225, 0.1). Note that community weights do not 
follow the number of individuals (100 in each community). No bias 
correction is necessary since the sample coverage is 1 in all cases. 
Entropy decomposition is plotted in Figure 3. For q = 0, a. and y 
entropy equal the number of species minus 1. The meta- 
community's y entropy is 3, including a entropy equal to 2.5 



(the average number of species minus 1). ft entropy is 0.5, equal to 
the averaged sum of communities contributions. C2's /J entropy is 
negative (the total /? entropy is always positive, but communities 
contributions can be negative). 

Considering Shannon entropy, CI is still the most diverse 
community (4 species versus 3 in C2 and C3, and a more equitable 
distribution: it has the greatest <x entropy equal to 1.29). C2 and 
C3 have the same a. entropy (their frequency distributions are 
identical) equal to 0.8. C3's species distribution is more different 
from the meta-community's than the others: it has the greatest /? 
entropy equal to 0.34. Entropies can be transformed into 
diversities to be interpreted: the a diversity of communities is 
3.6, 2.2 and 2.2 effective species, the total a diversity equals 2.8 
effective species. The meta-community's y diversity is 3.5 effective 
species (quite close to its maximum value 4 if all species were 
equally distributed) and // diversity is 1.2 effective communities: 
the same ft diversity could be obtained with 1.2 theoretical, equally 
weighted communities with no species in common. 

Real data application 

We now want to compare diversity between Paracou and BCI, 
the two forests introduced in the previous section. 

Diversity profiles are a powerful way to represent diversity of 
communities advocated recendy by [36], as a function of the 
importance given to rare species which decreases with q. 
Comparing diversity among communities requires plotting their 
diversity profiles rather than comparing a single index since 
profiles may cross (examples from the literature are gathered in 
[36], Figure 2). Yet, estimation bias depends on the composition of 
communities, questioning the robustness of comparisons: a 
consistent bias correction over orders of entropy is required. 

Entropy is converted to diversity and plotted against q in 
Figure 4 for our two forests: plots are given equal weight since they 
have the same size and gamma diversity is calculated for each 
meta-community. Paracou is more diverse, whatever the order of 
diversity. Bias correction allows comparing very unequally 
sampled forests (2 ha in Paracou versus 50 ha in BCI, sample 
coverage equal to 92% versus 99.99%). 

P diversity profile is calculated between the two plots of 
Paracou. To compare it with BCI which contains 50 1-ha plots, we 
calculated a and /? entropies between all couples of BCI plots, 
averaged them and converted them into f! diversity (a and /? 
entropies are required to calculate /? diversity). We also calculated 
the 95% confidence envelope of P diversity between two 1-ha plots 
of BCI by eliminating the upper and lower 2.5% of the distribution 
of all plot couples /? diversity. We chose to use Chao and Shen's 
correction up to q= 1.2 and Grassberger's correction for greater q 
to obtain comparable results in the 1225 pairs of BCI plots. 
Figure 5 shows Paracou's ft diversity is greater than BCI's, 
especially when rare species are given less importance: for q = 2 
(Simpson diversity), two plots in BCI are as different from each 
other as 1.2 plots with no species in common, while Paracou's 
equivalent number of plots is 1.7. In other words, dominant 
species are very different in Paracou plots, while they are quite 
similar on average between two BCI plots. 

The shape of /? diversity profiles is more complex than that of y 
diversity. At q = 0, [3 diversity equals the ratio between the total 
number of species and the average number of species in each 
community [7] . At q = 1 , it is the exponential of the average 
Kullback-Leibler divergence between communities and the meta- 
community. A minimum is reached between both. Over q = 1 , ft 
diversity increases to asymptotically reach its maximum value 
equal to x 'D y , i.e. the inverse of the probability of the most 
frequent species of the meta-community, divided by °° D a , i.e. the 
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Sample size Sample size 



Figure 2. Efficiency of bias correction. Estimation of diversity of the BCI tropical forest plot for two values of the order of diversity q (a: 0.5, b: 1.5), 
The horizontal line is the actual value calculated from the whole data (around 25000 trees, species frequencies are close to a log-normal distribution). 
Estimated values are plotted against the sample size (100 to 5000 trees). Solid line: naive estimator with no correction; dotted line: Grassberger 
correction; dashed line: Chao and Shen's correction. For q = 0.5, Chao and Shen perform best. For q = 1 .5, Grassberger's correction is very efficient 
even with very small samples. 
doi:1 0.1 371 /journal.pone.0090289.g002 



inverse of the probability of the most frequent species in each 
community. 

Discussion 

Diversity can be decomposed in several ways, multiplicatively, 
additively or non-additively if we focus on entropy. A well-known 
additive decomposition of Simpson entropy is as a variance (that of 
Nei [41] among others). It is derived in Appendix S2. It is not a 
particular case of our generalization: the total variance between 
communities actually equals /? entropy but the relative contribu- 
tion of each community is different. Among these several 
decompositions, only the multiplicative partitioning of equally- 
weighted communities (17) and the non-additive partitioning of 
entropy (19) allow independent a and ft components (except for 
the special case of q = 1), but unequal weights are often necessary 
and ecologists may not want to restrict their studies to Shannon 
diversity. 



We clarify here the differences between non-additive partition- 
ing and our additive decomposition and we address the question of 
unequally-weighted communities. 

Additive versus non-additive decomposition 

Jost [3] focused on independence of the p component of the 
partitioning. He showed (appendix 1 of [3]) that if communities 
are not equally weighted the only definition of q H a allowing 
independence between a and [3 components is 
q Ha= ^,-(wf / J2i w f)lHx- The drawback of this definition is 
that a. may be greater than y entropy if q ^ 1 and community 
weights are not equal. Each component of entropy partitioning 
can be transformed into diversity as a Hill number. 

We have another point of view. We rely on Patil and Taillie's 
concept of diversity of a mixture (section 8.3 of [14]), which 
implies Routledge's definition of a. entropy. It does not allow 
independence between a. and /? components of the decomposition 
except for the special case of Shannon entropy, but it ensures that 




CI C2 C3 Metacommunity CI C2 C3 Metacommunity 



Figure 3. Decomposition of a meta-community entropy. The meta-community is made of three communities named C1, C2 and C3 (described 
in the text). Their a. entropy l \H rl (bottom part of the bars) and their contribution to \i entropy \H$ (top part of the bars) are plotted for ^ = 0 (a) and 
q= 1 (b). The width of bars is each community's weight, a and ji entropies of the meta-community are the weighted sums of those of communities, 
so the area of the rectangles representing community entropies sum to the area of the meta-community's (width equal to 1). y entropy of the meta- 
community is a plus fl entropy. 
doi:1 0.1 371 /journal.pone.0090289.g003 
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Figure 4. Paracou and BCI y diversity. Diversity of the forest 
stations is compared. Solid line: Paracou with bias correction; dotted 
line: Paracou without bias correction; dashed line: BCI with bias 
correction; dotted dashed line: BCI without bias correction. Without bias 
correction, Paracou and BCI diversities appear to be similar for low 
values of q. Bias correction shows that Paracou is undersampled 
compared to BCI (actually around 1000 trees versus 25000). Paracou is 
much more diverse than BCI. 
doi:1 0.1 371 /journal.pone.0090289.g004 

ft entropy is always positive. We believe that independence is not 
essential when dealing with entropy, as it emerges when 
converting entropy to diversity, at least when community weights 
are equal. The /? component of the decomposition cannot be 
transformed into /? diversity without the knowledge of <x entropy 
but we have shown that it is an entropy, justifying the additive 
decomposition of Tsallis entropy. 

The value of fi entropy cannot be interpreted or compared 
between meta-communities as shown by [4] , but combining oc and 
P entropy allows calculating fj diversity (Table 1). 

Unequally weighted communities 

Routledge's definition of a entropy does not allow independence 
between a. and /? diversity when community weights are not equal, 
and diversity can exceed the number of communities [7] . We 
show here that the number of communities must be reconsidered 
to solve the second issue. We consider the independence question 
then. 

We argue that Routledge's definition always allows to reduce 
the decomposition to the equal-weight case. Consider the example 
of Chao et al. [7]: two communities are weighted W\ =0.05 and 
n>2=0.95, their respective number of species are S\ = 100 and 
^2 = 1 0, no species are shared, and we focus on q = 0 for 
simplicity. °D y equal 110 species, °D a is the weighted average of 
Si and S2 equal to 14.5, so Dp is 7.6 effective communities, 
which is more than the actual 2 communities. But this example is 
equivalent to that of a meta-community made of 1 community 
identical to the first one and 19 communities identical to the 
second one, all equally weighted. /? diversity of this 20-community 
meta-community is 7.6 effective communities. 

A more general presentation is as follows. A community of 
weight W can be replaced by any set of n identical communities of 
weights Wi,,,.,W„ provided that the sum of these weights is W, 
without changing a, /? and y diversity of the meta-community 
because of the linearity of Routiedge's definition of entropy. Any 
unequally weighted set of community can thus be transformed into 
an equally weighted one by a simple transformation (strictly 
speaking, if weights are rational numbers). 

Consider a meta-community made of several communities with 
no species in common, and say the smallest one (its weight is 
is the richest (its number if species is S max ). If S max is large enough, 



Figure 5. Paracou and BCI fl diversity, /i diversity profile between 
Paracou plots (solid line) is compared to that of any two plots of BCI 
(dotted line with 95% confidence envelope). 
doi:10.1371/journal.pone.0090289.g005 

the number of species of the meta-community is not much more 
than it (poor communities can be neglected), y richness °D y tends 
to S^, °D a tends to w„,j n S max , so °Dp tends to l/u',„,„. The 
maximum value /? diversity can reach is the inverse of the weight 
of the smallest community: its contribution to a. diversity is 
proportional to its weight, but its contribution to 7 diversity is its 
richness. Given the weights, the maximum value of /? diversity is 
thus \/w m i„; it is the number of communities if weights are equal. 

Comparing ji diversity between meta-communities made of 
different number of communities is not possible without normal- 
ization. Jost [3] suggests normalizing it to the unit interval by 
dividing it by the number of communities in the equal-weight case. 
We suggest extending this solution to dividing /? diversity by 
l/Wmin- When weights are not equal, the number of communities 
is not the appropriate reference. 

Although we could come back to the equally-weighted- 
community partition case, /? diversity is not independent of a 
diversity because communities are not independent of each other 
(some are repeated). Chao et al. (appendix Bl of [7]) derive the 
relation between the maximum value of Dp and 0 D a for a two- 
community meta-community: °Dp< [1 """^ — ]. The 

last term quantifies the relation between a and j8 diversity. It 
vanishes when weights are close to each other, and it decreases 
quickly with 0 D ol . If a diversity is not too low (say 50 species), the 
constraint is negligible fj-Dg can be greater than 0.98/n' m ;;i 
whatever the weights). 

A complete study of the dependence between a and /? diversity 
for all q values and more than two communities is beyond the 
scope of this paper but these first results show that this dependence 
is not so serious a problem as that between oc and /? entropy. As 
long as weights are not too unequal and diversity is not too small, 
results can be interpreted clearly. 

Very unequal weights imply lower /i diversity: the extreme case 
is when the larger community is the richest. If it is large enough, 
the meta-community is essentially made of the largest community 
and °Dp tends to 1 . This is not an issue of the measure, but a 
consequence of the sampling design. 

Conclusion 

The additive framework we proposed here has the advantage of 
generalizing the widely-accepted decomposition of Shannon 
entropy, providing a self-contained definition of p entropy and 
some ways to correct for estimation biases. Deformed logarithms 
allow a formal parallelism between HCDT and Shannon entropy 
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(equations (15) and (16) and Table 1). Of course, diversity can be 
calculated directiy, but no estimation-bias correction is available 
then. The additive decomposition of HCDT entropy can be 
considered empirically as a calculation tool whose results must 
systematically be converted to diversity for interpretation. 

We rely on Routiedge's definition of a entropy which allows 
decomposing unequally-weighted communities and takes place in 
a well-established theoretical framework following Patil and 
Taillie. The price to pay is some dependence between a and /? 
diversity when weights are not equal. It appears to be acceptable 
since it is unlikely to lead to erroneous conclusions. Still, a rigorous 
quantifying of it shall be the object of future research. 

We only considered communities where individuals were 
identified and counted, such as forest inventories. Entropy 
decomposition remains valid when frequencies only are available 
but our bias correction relies entirely on the number of individual: 
other techniques will have to be developed for these communities 
if unobserved species cannot be neglected. Bias correction is still an 
open question. We proposed a first and rough solution. More 
research is needed to combine the available approaches rather 
than using each of them in turn. 



We provide the necessary code for R to compute the analyses 
presented in this paper as a supplementary material in Appendix 
S4 with a short user's guide in Appendix S3. 
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